An Ontological Framework for Retrieving Environmental Sounds Using Semantics and Acoustic Content
نویسندگان
چکیده
Organizing a database of user-contributed environmental sound recordings allows sound files to be linked not only by the semantic tags and labels applied to them, but also to other sounds with similar acoustic characteristics. Of paramount importance in navigating these databases are the problems of retrieving similar sounds using textor sound-based queries, and automatically annotating unlabeled sounds. We propose an integrated system, which can be used for text-based retrieval of unlabeled audio, content-based query-by-example, and automatic annotation of unlabeled sound files. To this end, we introduce an ontological framework where sounds are connected to each other based on the similarity between acoustic features specifically adapted to environmental sounds, while semantic tags and sounds are connected through link weights that are optimized based on userprovided tags. Furthermore, tags are linked to each other through a measure of semantic similarity, which allows for efficient incorporation of out-of-vocabulary tags, that is, tags that do not yet exist in the database. Results on two freely available databases of environmental sounds contributed and labeled by nonexpert users demonstrate effective recall, precision, and average precision scores for both the text-based retrieval and annotation tasks.
منابع مشابه
Vibrotactile Identification of Signal-Processed Sounds from Environmental Events Presented by a Portable Vibrator: A Laboratory Study
Objectives: To evaluate different signal-processing algorithms for tactile identification of environmental sounds in a monitoring aid for the deafblind. Two men and three women, sensorineurally deaf or profoundly hearing impaired with experience of vibratory experiments, age 22-36 years. Methods: A closed set of 45 representative environmental sounds were processed using two transposing (TRH...
متن کاملShortest Path Techniques for Annotation and Retrieval of Environmental Sounds
Many techniques for text-based retrieval and automatic annotation of music and sound effects rely on learning with explicit generalization, training individual classifiers for each tag. Non-parametric approaches, where queries are individually compared to training instances, can provide added flexibility, both in terms of robustness to shifts in database content and support for foreign queries,...
متن کاملA Framework for Business Intelligence Application using Ontological Classification
Every business needs knowledge about their competitors to survive better. One of the information repositories is web. Retrieving Specific information from the web is challenging. An Ontological model is developed to capture specific information by using web semantics. From the Ontology model, the relations between the data are mined using decision tree. From all these a new framework is develop...
متن کاملStoring and Retrieving Software Components: A Component Description Manager
The aim of the paper is to present the results of research into Component-Based software development by providing a specification mechanism allowing searching for components in a component repository. A new component classification framework is proposed based on which a Component Description Manager has been designed and implemented. The classification framework combines domain knowledge, ontol...
متن کاملAn Ontological Approach to the specification of Semantics for Learning Content. The convergence of knowledge management and technology enhanced learning
The PhD Thesis is concentrating in the convergence of knowledge management and technology enhanced learning towards the effectiveness in the design and exploitation of learning content. The main emphasis is paid to the modelling of the learning content development process and through ontological considerations the thesis contributes in theory and practice as follows: It proposes a Life Cycle mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- EURASIP J. Audio, Speech and Music Processing
دوره 2010 شماره
صفحات -
تاریخ انتشار 2010